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Abstract: The Tevatron experiments have recently excluded a Standard Model Higgs 



boson in the mass range 160 GeV < mn < 170 GeV at the 95% confidence level. This 
result is based on sophisticated analyses designed to maximize the ratio of accepted signal 
to background. In this paper we study the production of a Higgs boson of mass mn = 160 
GeV in the gg —* H — > WW — * lulu channel. We choose a set of cuts like those adopted in 
the experimental analysis and compare kinematical distributions of the final state leptons 
computed in NNLO QCD to lower-order calculations and to those obtained with the event 
generators PYTHIA, HERWIG and MC@NLO. We also show that the distribution of the 
output from an Artificial Neural Network obtained with the different tools does not show 
significant differences. However, the final acceptance computed with PYTHIA is smaller 
than those obtained at NNLO and with HERWIG and MC@NLO. We also investigate the 
impact of the underlying event and hadronization on our results. 
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1. Introduction 

Clarifying the role of Higgs bosons in the breaking of electroweak symmetry is of paramount 
importance in improving our understanding of elementary particle interactions. The discov- 
ery of the Standard Model Higgs boson, or its equivalent in theories beyond the Standard 
Model, is a principal objective of the high-energy collider experimental program. 

Now is a very exciting time in the Higgs search since the Tevatron experiments have 
become sensitive to potential signals from a Higgs boson with production cross-sections of 
roughly the magnitude predicted by the Standard Model (SM). Both CDF [1,2] and D0 [3] 
have presented studies where a cross-section about 1.5 — 1.7 times the SM prediction for a 
Higgs boson with a mass about twice the W-mass can be excluded by the two experiments 
independently with a confidence level of 95%. Recent preliminary combinations [4, 5] of 
the two experiments exclude a SM Higgs boson in the mass range 160 — 170 GeV with a 
95% confidence level. In this mass range, a signal is predominantly produced in the SM 
via the gluon fusion process pp — ► H — > WW — > llvv. 

Higher order calculations for the background processes and especially the signal cross- 
section are indispensable for the study or exclusion of high mass Higgs bosons at the 
Tevatron. The magnitude of QCD corrections for the dominant signal process is extraordi- 
narily large; the inclusive cross-section at next-to-next-to-leading order (NNLO) for gluon 
fusion is about three times larger than the leading order (LO) cross-section. This is an 
even larger if-factor than that obtained at LHC energies. The Tevatron experiments are 
sensitive to a Standard Model Higgs boson signal in the mass region where gluon fusion 
is dominant, due to this particularly large If -factor and the small theoretical uncertainty 
which is attained with calculations at higher orders in perturbation theory. 

Next-to-leading order (NLO) corrections have been computed in [8-10], where the 
anomalously large higher order effects in gluon fusion were first shown. The NNLO cor- 
rections for the inclusive cross-section have been computed in [11-13]. The NNLO com- 
putation has been consistently improved by resumming the soft-gluon contributions up 
to next-to-next-to-leading logarithmic (NNLL) accuracy [14]. The result is an additional 
increase of the cross section that amounts to about 13% at the Tevatron. The NNLL re- 
sult is nicely confirmed by the computation of additional soft terms to N 3 LO [15-17] (see 
also [18-20]). A calculation based on a somewhat different approach has been presented in 
Ref. [21]. 

The calculations of threshold effects have provided an invaluable argument that by 
now the bulk of the higher order corrections is accounted for. Smaller theoretical effects 
can also be important for setting a precise exclusion limit on the Higgs boson cross-section 
at the Tevatron. Two-loop electroweak corrections (of about ~ 5%) for gluon fusion Higgs 
production have been computed in [22-24] , and the full two-loop amplitude has been 
presented in [25,26]. Mixed QCD and Electroweak corrections have been computed in [27]. 
Recent predictions for the inclusive cross-section taking into account these effects have been 
presented in [27,28]; these theoretical results for the inclusive gluon fusion cross-section 
have been used in setting the exclusion limits of Ref. [5] . 

Electroweak corrections of 0(a) for the decay H — > WW — ► tvtv have been computed 
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in [29]. These reach 6% for the partial decay width of the decay of a Higgs boson to four 
leptons with mass of 170 GeV. In the region just above mn > 2M\y, which is relevant 
for the recent Tevatron exclusion limits, Ref. [29] reported important differences with the 
prediction of the program HDECAY [30]. These corrections have not yet been included 
into estimates of the Higgs branching ratio to leptons, and they are not considered in the 
experimental analysis of Refs. [4,5]. 

The extraction of limits on the Higgs boson cross-section from the study of the H — > 
WW — > tvtv process requires a sophisticated analysis. Background processes, such as direct 
WW, WZ, ZZ production as well as ti, Wj and W+ (multi)jet production, are dominant. 
Experimental cuts can suppress background and enhance the signal by vetoing hadronic 
radiation and by exploiting characteristic differences in lepton angular distributions as well 
as the large missing transverse momentum in signal events [31]. 

At the Tevatron, a cut-based analysis alone is not sufficient. Additional methods that 
exploit efficiently the kinematic features of signal and background processes in their finest 
detail are required. CDF [2] and D0 [3] apply cuts on missing energy and jet activity and 
impose lepton isolation criteria only for purposes of a first rough selection which biases 
the data samples towards signal events. After this first cut-based selection, background 
processes remain dominant and processing of real data and Monte-Carlo simulations with 
Artificial Neural Network (ANN) methods follows. It is easy to appreciate the importance 
of ANN techniques for setting exclusion limits at the Tevatron. For example, in a data 
sample analyzed passing first cut selection in Ref. [3], the data model employed there 
predicts only ~ 12 signal versus ~ 337 background e ± ^ T events. 

Given the sensitivity of the gluon fusion cross-section to higher order effects, it is impor- 
tant to establish that the sophisticated methods used in the Tevatron analysis [5] account 
for these effects within the estimated uncertainties. Already the first cut selection may 
change the relative importance of higher order corrections for signal and background cross- 
sections with respect to the perturbative patterns observed in the inclusive cross-sections. 
A complete simulation of the experimental analysis with NLO and NNLO perturbative 
corrections is not practically feasible. In this paper, we aim to provide precise predic- 
tions for the production cross section in conditions close to those of the actual experiments 
by combining knowledge of higher-order effects from fixed-order perturbation theory and 
parton shower event generators. 

First, we provide fixed-order predictions which are sufficiently detailed to permit as- 
sessment of the sensitivity of the acceptance due to selection cuts. For such a study, fully 
differential cross-sections of the signal at NNLO are required. Unlike NLO computations, 
NNLO differential calculations are a rarity due to their substantial technical complications. 
The first differential distribution at NNLO was computed in 2003 [32,33], and fully differen- 
tial NNLO cross-sections appeared in 2004 [34,35]. At an electron-positron collider NNLO 
differential cross-sections are known only for two [34,36] and three jet production [37-44] 
cross-sections. At hadron colliders fully differential cross-sections have been computed only 
for Higgs production in gluon fusion [35,45-48], and the Drell-Yan process [51-53]. 

In this paper we compute accepted cross-sections and kinematic distributions at NNLO 
using the programs FEHIP [45,47] and HNNLO [48]. We find an excellent agreement 



- 2 - 



between the predictions of the two programs. We remark that the methods used [46, 54] 
for constructing these fully differential NNLO programs are independent and very different 
in their conception. Both programs produce kinematic distributions in the form of bin 
histograms. All NNLO results for cross-sections and kinematic distributions calculated with 
both programs and presented here are in agreement within the expectations of statistical 
integration errors. The NNLO acceptance of the selection cuts can be directly compared 
with the predictions from the modeling of data as it has been performed by the CDF [2] 
and D0 [3] collaborations. The kinematic distributions we present here are input for their 
ANN analyses. 

Two selection cuts require special care in estimating their acceptance for the Tevatron 
studies: (i) a jet veto on two or more central jets and (ii) isolation of leptons from hadronic 
activity. These cuts are used in D0 [3] in order to define the entirety of the data sample 
and at CDF [2] that part of the sample with a potential Higgs boson signal. At CDF [2], 
a further division of the data sample into zero- and one-jet multiplicities is made. The 
relative magnitude of the perturbative corrections at NLO and NNLO with respect to LO 
(if-factor) is smaller after applying selection cuts. 

The same observation was made in earlier NNLO studies [47, 48] of the process pp — > 
H — ► WW — ► tutv at LHC center-of-mass energy of 14 TeV, with similar cuts, such as a jet- 
veto. In a separate paper [55], these NNLO predictions at the LHC were compared to results 
obtained from (%) a resummation of logarithms in the transverse momentum of the Higgs 
boson at NNLL accuracy [56] and (ii) the Monte-Carlo event generator MC@NLO [57]. 
They were found to be in good agreement with each other over the phase-space regions 
singled out by the event selection cuts. On the contrary, a fixed-order calculation at next- 
to-leading order (NLO) accuracy provided a rather poor approximation for the required 
distributions and cut efficiencies. 

In this paper we compare the selection cut acceptance and the shape of kinematic 
distributions for leptons at NNLO with the event generators MCONLO [57], HERWIG [58], 
and PYTHIA8 [59] for Tevatron collisions. 

The simulation of the gluon fusion process in Refs. [2,3], is performed with PYTHIA6 [60]. 
In these analyses, the uncertainty in the acceptance after selection cuts is estimated 
with other means rather than a direct NNLO calculation. In the CDF analysis [2], 
PYTHIA events are reweighted [61,62] to match either the (N)NLO Higgs or the NNLO 
Higgs rapidity spectrum [45,48]. The systematic uncertainty in the acceptance is computed 
from the differences between the original PYTHIA and the reweighted versions. In DO, 
acceptance uncertainties are estimated by comparing the spectrum of PYTHIA with that 
of other generators, such as MCONLO [57] and SHERPA [63]. We believe that neither of 
the two methods can substitute for a direct comparison with the acceptance at NNLO. We 
will discuss this point in Section 4. 

Multivariate techniques and distributions of ANN variables have so far been "terra 
incognita" for theoretical calculations at higher orders in perturbation theory. To the best 
of our knowledge, there has been no calculation of NLO and NNLO corrections for such 
observables. So far, the systematic uncertainty due to higher order effects on the shape 
of ANN distributions has been estimated indirectly. ANNs construct composite output 
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variables that maximize the differences between signal and background cross-sections. An 
error estimate on the shape of the ANN output distribution is obtained by varying the 
input kinematic distributions within their uncertainty range. However, if the input vari- 
ables have a proper definition at the parton level, there is no obstacle to computing the 
corrections directly at fixed order in perturbation theory in the same fashion as for any 
other simple partonic variable. This should provide a more reliable theoretical estimate of 
the uncertainty on the ANN distribution. 

We demonstrate such a calculation of an ANN output distribution through NNLO 
in this paper. We train an ANN with a PYTHIA simulated data sample that satisfies 
the selection in Ref. [1], which is a similar but somewhat simpler selection than that in 
Refs. [2,3]. As input we use kinematic distributions of the leptons in the final state. We 
have deliberately refrained from using variables such as the transverse momentum of the 
Higgs boson or jets, since these distributions may differ substantially in event generators, 
and they are not defined in their full physical range in fixed order perturbation theory. 
We compare the predictions of PYTHIA, MC@NLO and HERWIG for this ANN output, 
and find reasonable agreement if the observed discrepancies at the cut selection levels are 
already accounted for. 

The organization of the paper is as follows: We first present in Section 2 our predictions 
for the inclusive cross section at various orders of perturbation theory, then in Section 3 
we define the experimental observables to which selection cuts are applied. Section 4 is 
devoted to a discussion of the Higgs pt spectrum and of the jet multiplicities. In Section 5 
we give the results for the accepted cross section, i.e. after applying cuts on the various 
observables, and discuss the impact of the higher order corrections and scale variations on 
the selection efficiency. Next we present detailed comparisons of kinematical distributions, 
calculated at different orders of perturbative QCD (Section 6) and by using parton shower 
Monte Carlo models (Section 7). In Section 8 we compare for the first time fixed-order 
and parton shower Monte Carlo predictions for the output of an artificial neural network 
(ANN) similar to that used by the experimental groups. We comment on the stability and 
accuracy of these perturbative predictions, depending on the type of input variables to the 
ANN. Our conclusions are summarized in Section 9. 

2. Inclusive cross section 

Recent updates on the inclusive cross section for Higgs boson production at hadron colliders 
have been presented in Refs. [27,28]. 

In Ref. [27], the fixed order NNLO cross-section [11-13] is recomputed with MSTW2008 
parton densities [64]. Two-loop electroweak corrections from [25,26] and exact finite top 
and bottom mass effects [10,65] are included. Mixed QCD electroweak effects are taken into 
account by computing the relative magnitude of the correction with respect to the leading 
two-loop electroweak contribution by means of an effective theory. The central value of the 
cross-section is determined at a factorization and renormalization scale = mn/2. The 
scale variation error for Higgs mass values 160 — 170 GeV in this fixed order calculation is 
+7%, —11%. The corresponding parton density error is ~ ±10%. 
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In Ref. [28], the NNLL calculation [14] with the appropriate matching to the fixed 
order NNLO result [11-13] is repeated using the MSTW2008 parton densities [64]. Two- 
loop electroweak corrections from [25,26] and exact finite top and bottom mass effects [10] 
are included. Mixed QCD electroweak effects are taken into account by multiplying the 
inclusive two-loop electroweak contribution [25, 26] with the QCD iGfactor. The central 
value of the cross-section is determined at a factorization and renormalization scale n = mn- 
The scale variation error for Higgs mass values 160 — 170 GeV in this resummed calculation 
is +9%, —8%. The corresponding parton density error is ~ ±8%. 

The central values of the cross-sections in Refs [27, 28] agree within a percent, and 
they have been used in setting the exclusion limits in [5]. CDF assigns a global theoretical 
uncertainty on the inclusive cross-section of ±12% [2], adding in quadrature a ±11% scale 
variation error and a ±5% parton density error. In our opinion, the combination of the 
two errors in quadrature requires further justification. D0 assigns a somewhat smaller 
theoretical uncertainty of ±10% to the inclusive Higgs boson cross-sections. Even justifying 
a combination in quadrature of the uncertainties from parton densities and scale variations, 
the uncertainties on the total rate appear underestimated, compared to those of Refs. [27, 
28]. 

The main goal of the present paper is to assess the theoretical uncertainties on the 
shapes of kinematic distributions and the acceptance after selection cuts. We have used 
MRST2001 PDFs at LO and MRST2004 PDFs at NLO and NNLO. All the fixed-order 
results in this paper have been obtained independently with the FEHiP [35,45,47,55] and 
HNNLO [46,48] programs, by first calculating iGfactors in the limit of a very heavy top- 
quark and then by multiplying these K-factors with the exact leading order gluon fusion 
cross section for Higgs production via a top-quark (ignoring bottom contributions). The 
total width is computed using the program HDECAY [30]. 0(a) electroweak corrections 
for the Higgs partial decay to leptons [29] have also been ignored. This is justified for 
studies of shapes and acceptances. 



O-inc [fb] 


LO 


NLO 


NNLO 


^NLO 


^-NNLO 


H = m H /2 
H = m H 
/x = 2 mn 


1.998 ±0.003 
1.398 ±0.001 
1.004 ±0.001 


4.288 ± 0.004 
3.366 ± 0.003 
2.661 ±0.002 


5.252 ±0.016 
4.630 ± 0.010 
4.012 ± 0.007 


2.149 ±0.008 
2.412 ± 0.002 
2.651 ± 0.008 


2.629 ± 0.009 
3.312 ±0.008 
3.996 ± 0.008 



Table 1: Inclusive cross sections for mn = 160 GeV, at various orders in perturbation theory and 
for different scale choices. The iVfactors are defined in the text; LO=Lcading order. 

The cross-section for pp — ► H — > WW — ► £u£u, with no cuts applied and using the 
setup described in the last paragraph, is presented in Table 1 for illustration. We have 
chosen to study the signal cross-section for a Higgs mass value mu = 160 GeV, and we 
vary the renormalization (/xr) and factorization (//p) scales simultaneously in the interval 
ji = zip = jUR G [mn/2, 2 ran]. In the same Table, we also present the K-factors for the 
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inclusive cross section, 

A .«N,NLO (fl) = (21) 

The perturbative corrections are very large, and there is still a substantial 23—50% increase, 
depending on the scale choice, of the cross-section at NLO upon including the NNLO 
corrections. The smallest if-factors occur for smaller values of the renormalization and 
factorization scale. 

3. Observables and selection cuts 

The very sophisticated and complex experimental analyses of the two Tevatron collabora- 
tions cannot be reproduced fully at a perturbative level. In addition, there are important 
differences between the CDF and D0 analyses, both at the event selection level and in the 
usage of ANNs, and they evolve with time and the acquisition of more data. Nevertheless, 
we believe the analysis presented here captures the essential features that allow us to study 
the likely sensitivity of the results to higher order effects. 

Our particle and event selection follows along the lines of the CDF analysis of Ref. [1] 
and proceeds through the following steps: 

1. Lepton selection: in the CDF experiment, the experimental acceptances for electrons 
and muons are different. In this publication, we only consider the final state with 
two muons, simulating only the muon acceptance. The differences in the CDF cuts 
for muons and electrons are rather geometric, and should not alter the convergence 
pattern of the perturbative corrections. In our analysis, one of the final-state leptons 
(the 'trigger lepton') must have a transverse momentum j>t > 20 GeV and pseudo- 
rapidity \rj\ < 0.8. In order to pass a further lepton selection, a second lepton must 
be found with > 10 GeV and < 1.1. 

(a) Two opposite-sign leptons have to be found, fulfilling the requirements discussed 
above. 

(b) Both leptons have to be isolated, i.e. the additional transverse energy in a cone 
with radius R = 0.4 around the lepton has to be smaller than 10 % of the lepton 
transverse momentum. 

(c) In order to reduce the background from b resonances, the invariant mass of the 
lepton pair has to be mu > 16 GeV. 

2. We define the missing transverse energy (MET) as the vectorial sum of the transverse 
momenta of the two neutrinos. We define the variable MET* as 

f MET , 6 > tt/2 

MET* = < , , 3.1) 

\ MET x sm<p , 4> < tt/2 ' v ; 

where (ft is the angle in the transverse plane between MET and the nearest charged 
lepton or jet. We require MET* > 25 GeV, which suppresses the background from 
Drell-Yan lepton pairs and removes contributions from mismeasured leptons or jets. 
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3. In order to suppress the tt background, we apply a veto on the number of jets in the 
event. Jets are found using the A:T-algorithm [49, 50] with parameter R = 0.4. A jet 
must have pr > 15GeV and |ry| < 3.0. Events are only accepted if there is no more 
than one such jet. 

The jet veto that we apply here is different from that used in our corresponding LHC 
studies [47,48,55], where all events with any number of central jets with a pt higher than 
a certain minimum value are vetoed. The cuts in the present study allow for events with 
a single high-px jet. This type of jet veto is used in the D0 analysis [3] in order to define 
the data sample with a potential Higgs signal. A stricter jet veto is applied in the CDF 
analysis [2], where three data samples are defined according to whether events have zero, 
one, or more central jets. 

4. Higgs pr spectrum and jet multiplicities 



pp^H+X pp^H+X 




20 40 60 80 100 20 40 60 80 100 



p? (GeV) p? (GeV) 



Figure 1: On the left figure, we show the normalized transverse momentum distributions for 
Tor = 160 GeV and /i = /ir = [if = tuh using NNLO fixed-order perturbation theory and the re- 
summed calculation of Ref. [56]. On the right figure, the same distribution is shown for MC@NLO, 
PYTHIA8, and the calculation of Ref. [56]. 

One of the most important distributions for a Higgs boson produced at hadron colliders 
is its transverse momentum spectrum. A good description of the spectrum implies a 
good understanding of the QCD radiation recoiling against the Higgs. 

It is well known that the Higgs px spectrum is not physical when computed at fixed 
order, since it diverges to +oo or — oo at any fixed order in a s . When pr <C mu large 
logarithmic contributions of the form a™ ln m mn/pT appear that must be resummed to all 
orders. In Ref. [56] the resummation of these logarithmically enhanced terms has been 
performed analytically up to NNLL accuracy, and the result has then been matched to the 
fixed order calculation up to O(af). The integral of the ensuing spectrum coincides with 
the total NNLO cross section. In Fig. 1 (left) we compare the normalized pi spectrum 
of the Higgs computed at fixed order, to the one obtained with the numerical program of 
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Ref. [56]. We see that the fixed order calculation diverges to — oo as pr — ► 0. The resummed 
calculation is instead well behaved as px — > 0. We also note that the two calculations are 
in good agreement for px larger than about 30 GeV. 

Standard Monte Carlo event generators effectively perform the transverse-momentum 
resummation, and thus obtain a well behaved px spectrum. A comparison of the px spectra 
from PYTHIA and MC@NLO to the resummed calculation is presented in Fig. 1 (right). 
We notice that the PYTHIA spectrum is softer than those of MC@NLO and the NNLL 
calculation. This difference is more marked at LHC energies [66]. 

We now comment on the procedure used by CDF and D0 to estimate the systematic 
uncertainties on the signal acceptance. The simulation of the gluon fusion process in 
Refs [2,3], is performed with PYTHIA [60]. In these analyses, the uncertainty in the 
acceptance after selection cuts is estimated with different methods. In the CDF analysis [2], 
PYTHIA events are reweighted [61,62] to match either the (N)NLO Higgs px or the NNLO 
Higgs rapidity spectrum [45,48]. The systematic uncertainty in the acceptance is computed 
from the differences between the original PYTHIA and the reweighted versions. In D0, 
acceptance uncertainties are estimated by comparing the spectrum of PYTHIA with that 
of other generators, such as MC@NLO [57] and SHERPA [63]. We believe that neither of 
the two methods can substitute for a direct comparison with the acceptance at NNLO. 

The reweighting technique used by CDF is based on the NNLO pt spectrum of the 
Higgs boson. As shown in Fig. 1, the fixed-order Higgs px spectrum is divergent as px — ► 0. 
A reweighting procedure based on the NNLO Higgs px spectrum makes sense only if the 
signal rate is integrated over a reasonably large pt region (as is done, for example, in 
Ref. [62]). In this respect, it would be better to reweight using a resummed calculation [56], 
or to use different kinematic variables. 

Note also that a reweighting based on the Higgs px and rapidity is insensitive to the 
jet multiplicity of the event, which is used in order to divide the data sample. Another 
important aspect is that the relative weight of the one- and two-jet sample is enhanced by 
the CDF cuts, with respect to the zero-jet sample. If a jet is required in all events, the 
0(a\) calculation includes matrix elements through NLO only. If two jets are required, 
only LO matrix-elements are taken into account. An NLO prediction for 1-jet samples 
and a LO prediction for 2-jet samples 1 can be obtained with FEHIP and HNNLO using 
the corresponding NLO and LO a s and parton densities. More importantly, we find it 
inconsistent to use the theoretical uncertainty from the inclusive NNLO gluon fusion cross- 
section as the uncertainty of the samples with defined jet multiplicities other than zero. 

To demonstrate this point further, we follow an analogous procedure as in [2] and 
divide the signal cross-section into three bins according to the number of central jets. 
Jets are defined using the /cx-algorithm, with minimum px = 15 GeV and maximum 
rapidity \r/\ < 2. We compute the inclusive cross-sections for toh = 160 GeV and vary 
the renormalization and factorization scale simultaneously in the interval [mn/2, 2mn]. In 
Table 2 we compute the three bin cross-sections, using either NNLO or NLO or LO parton 
densities and a s evolution from the recent MSTW2008 fit [64]. 

1 An NLO calculation for H + 2 jets via gluon fusion can be found in [67]. 
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a[fb] 


LO (pdfs, a s ) 


NLO (pdfs, a s ) 


NNLO (pdfs, a s ) 


O-jets 


^ 4^9+ 7% 

^• 40/ -io% 


2.883+^ 


9 7f)7 +5% 


1-jet 


1 7c;o+307c 

i.ioz_ 26% 


1.280™ 


1 165 +24% 
1.1D0_ 22% 


> 2-jets 






n 1 Qfi+ 78% 



Table 2: Inclusive cross sections in the different jet bins. 

The total cross-section with NNLO pdfs varies around the default scale value [i = mu 
by ±14%. From Table 2 we see that about 66.5% of the events contain zero jets, 28.6% one 
jet only, and 4.9% contain more than one jets. Notice, however, that the scale variation in 
the three jet bins is significantly different and deteriorates with increasing jet multiplicity. 
This is a consequence of the fact that in the 1-jet and 2-jet bins the fixed order calculation 
is only accurate through NLO and LO, respectively. The resulting scale dependence of the 
inclusive cross section is made up as follows: 

A-/V; nc (scale) ,. ( , r:<V / +- 1 i I n<Y M~ ■ i — I r ' " • I i |) 



66.5% • + 28.6% • (+**) + 4.9% • (+J*) = ^ 



The application of different selection cuts in the three jet bins leads to a theoretical 
error estimate of the number of signal events which is different from the theoretical error 
of the inclusive NNLO cross-section. Specifically, from Tables 1-3 of Ref. [2] we observe 
that, after preselection, 60% of gluon fusion events belong to the 0-jets bin, 29% to the 
1-jet bin, and 11% to the 2-jet bin. 

We now examine how this modification of the jet multiplicities with the experimental 
cuts affects the scale variation for the total number of events. With the exception of the 
jet-veto, all other cuts used in the CDF preselection [2] do not affect the scale variation of 
the total cross-section significantly. We can then estimate the scale variation of the total 
number of signal events using the scale- variations for each jet-multiplicity in Table 2 and 
the expected composition of jet-multiplicities for the signal [2]. Using NNLO pdfs and 
NNLO a s evolution for all jet bins, we find that: 

^signal 

The resulting scale variation is therefore larger than the corresponding scale variation of 
±14% for the inclusive cross-section. 

Notice that in Eq. 4.2 we used a scale variation for the one-jet and two-jet bins cor- 
responding to NNLO pdfs and a s evolution. A more consistent approach would be to 
estimate the number of events in the 1-jet and 2-jet bins using NLO and LO pdfs and a s 
evolution correspondingly. In this way we obtain: 



'Jf^ = 60 % ■ + 29% • (;») + 11% ■ (ijg) = («S) (4.2) 



AiVsignal (scale) (+5%\ , 9q% f+24%\ /+91%\ _ (+20.0% 
__ _ bO/o • ^_ 9% J + 29/o • ^_ 23% J + 11/0 • ^_ 44% J - ^_16. 9% 



(4.3) 



The relative population of the jet bins is very important for the determination of 
the theoretical error on the total number of events. The contribution of the different jet 
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multiplicities to the total error can be altered also after the preselection cuts, since, in 
general, the independent multivariate methods for discriminating signal from background 
events in various jet bins should have different discriminating efficiency. In conclusion, 
the theoretical error for the number of events at various jet multiplicities should not be 
estimated collectively from the scale variation of the total cross-section. 

5. Signal cross section and preselection efficiency at fixed order 



We present in Table 3 the LO, NLO, and NNLO cross sections, as obtained with FEHIP 
and HNNLO after applying the selection cuts in Section 3, for a default Higgs boson mass 
value mu = 160 GeV. 



0"acc [fb] 


LO 


NLO 


NNLO 


^NLO 


^-NNLO 


\x = m H /2 
H = m H 
[i = 2 ran 


0.750 ±0.001 
0.525 ±0.001 
0.379 ±0.001 


1.410 ±0.003 
1.129 ±0.003 
0.903 ± 0.002 


1.459 ± 0.003 
1.383 ± 0.004 
1.242 ± 0.001 


1.880 ±0.005 
2.150 ±0.007 
2.383 ± 0.008 


1.915 ±0.025 
2.594 ±0.052 
3.261 ±0.048 



Table 3: Accepted cross sections and ^-factors after the application of all the selection cuts for 
to h = 160 GeV. 

Comparison of the results of Table 3 with those of Table 1 shows that the impact of 
QCD radiative corrections is significantly reduced when selection cuts are applied. Indeed, 
for up = fiji = rriH the NLO and NNLO i^-factors are reduced by 11% and 22%, respec- 
tively As a consequence, the acceptance is also reduced, since it is defined as the ratio of 
the cross-section after cuts to the inclusive cross section. At LO about 38% of the events 
are accepted. At NLO, the efficiency drops to 33% - 34% and at NNLO to 28% - 31%, 
depending on the scale choice. 

An important observation is that the scale dependence of the efficiency becomes 
stronger when we increase the perturbative order. The lepton isolation and jet-veto cuts 
do not change the LO cross section, since the isolation requirement gives a non-vanishing 
contribution only at NLO and the veto on the number of central jets is effective only beyond 
NLO. We will comment further on the jet-veto later when comparing to the Monte-Carlo 
event generators, PYTHIA, HERWIG and MCQNLO. 

The increased scale variation of the acceptance at NNLO is a reflection of the reduction 
of the scale variation for the cross section after cuts are applied. In Fig. 2, we present the 
cross section as a function of the renormalization and factorization scale fi = (1r = [xf, 
before and after cuts, at LO, NLO, and NNLO. Before cuts, the NNLO cross section varies 
by 27% over the interval [tjih/2, 2mn]. This drops to a 16% scale variation after cuts, and 
consequently the ratio varies by ~ 11%. We note that the scale variation in a different 
but also commonly used range [mn/4, mi] is 20% for the inclusive cross-section and 6% 
for the accepted cross-section after cuts. In this second scale variation range, the accepted 
cross-section is not monotonic and develops a maximum. 
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Figure 2: Cross sections for mn = 160 GeV as a function of the scales /i — /ir — /if for different 
orders in fixed-order perturbation theory. The solid lines represent inclusive cross sections, the 
dashed lines cross sections after the application of the selection cuts. 

6. Kinematic distributions at fixed order perturbation theory 

We now turn to a more detailed study of the kinematical properties of the accepted events. 
The applied cuts provide a rough discrimination of the Higgs signal over the background. 
The experimental analysis proceeds further by exploiting the differences in the kinematical 
distributions between signal and background and is typically based on Artificial Neural 
Network (ANN) techniques. It is thus important to check that these distributions are 
stable against radiative corrections. 

In Fig. 3 we present a set of distributions that are commonly used in the experimental 
analysis, computed at LO, NLO and NNLO. The uncertainty bands are obtained by varying 
i^F = between m#/2 and 2mu- Figures 3(a) and (b) show the transverse momentum 
spectrum of the leading and trailing lepton, ptp rd and p™ ft . Figure 3(c) shows the invariant 
mass distribution of the lepton pair, ma, Fig. 3(d) shows the MET distribution and, finally, 
Fig. 3(e) shows the azimuthal separation of the two charged leptons in the transverse plane, 
4>u. Overall, these plots show that the distributions are quite stable when going from NLO 
to NNLO. The NNLO band generally lies on top of the NLO band and nicely overlaps 
with the latter. The scale uncertainty of the distributions is consistent with that quoted 
in Table 3 and is about ±(6 — 8)% in the peak region of the distributions. 

We also observe that the perturbative corrections from NLO to NNLO are smaller 
when the scale \x = ma/ 2 is used. 
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Figure 3: Kinematic distributions obtained at LO, NLO and NNLO in perturbative QCD. Shown 
are the transverse momentum of the leading (p!p ald ) and trailing lepton (p^? ft ), the invariant mass 
of the two leptons (to/;), the transverse missing energy (MET) and the azimuthal separation of the 
two charged leptons in the transverse plane (4>u). 



7. Comparison with parton shower event generators 



In the next step we compare the accepted cross-section and kinematic distributions after 
selection cuts, obtained in fixed order perturbation theory, with the predictions of the 
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parton shower event generators HERWIG, PYTHIA and MC@NLO. We consider this as 
a very important validation for the current studies at the Tevatron, which rely on event 
generator predictions. 



face [fb] 


H = m H /2 


H = m H 


H = 2 mu 


LO 

R LO (HERWIG) 
R LO (PYTHIA) 


0.750 ± 0.001 
0.571 ± 0.001 
0.483 ± 0.001 


0.525 ± 0.001 
0.399 ± 0.001 
0.338 ± 0.001 


0.379 ± 0.001 
0.287 ± 0.001 
0.245 ± 0.001 


NLO 

MC@NLO 
R NLO (HERWIG) 
R NLO (PYTHIA) 


1.410 ± 0.003 
1.143 ± 0.002 
1.225 ± 0.001 
1.037 ± 0.002 


1.129 ± 0.003 
0.906 ± 0.003 
0.962 ± 0.001 
0.814 ± 0.002 


0.903 ± 0.002 
0.718 ± 0.002 
0.760 ± 0.001 
0.645 ± 0.001 


NNLO 

i? NNLO (MC@NLO) 
i? NNLO (HERWIG) 
^>NNLO (PYTHIA) 


1.459 ± 0.003 
1.399 ± 0.003 
1.501 ± 0.001 
1.271 ± 0.002 


1.383 ± 0.004 
1.246 ± 0.003 
1.323 ± 0.001 
1.120 ± 0.002 


1.242 ± 0.001 
1.082 ± 0.003 
1.146 ± 0.001 
0.971 ± 0.002 



Table 4: Comparison of the accepted cross-section obtained in fixed order perturbation theory 
and from various event generators. The predictions from event generators are rescaled such that 
the predictions for the inclusive cross-section are in agreement with the corresponding fixed order 
result. 

In Table 4 we present the predictions of the fixed order calculations through NNLO, 
as well as the predictions of PYTHIA, HERWIG and MC@NLO. At this stage we do 
not include a simulation of hadronization and the underlying event, in order to make a 
more direct comparison with the fixed order predictions, which cannot take into account 
such effects. The total inclusive cross-section of PYTHIA and HERWIG corresponds to 
a pure LO computation, and that of MC@NLO is correct at NLO accuracy. Since we 
are interested in a comparison of the efficiencies and not of absolute cross-sections, we 
multiply the results of the event generators with appropriate scaling factors, such that they 
match the result of the total inclusive cross-section given by our fixed order computation. 
Because of the different parton distribution functions and approximation for the top loop 
used in the various calculations, these scaling factors to match the leading fixed order 
calculations [45,48] are different from unity at the level of 5%. 

After applying the selection cuts, we find a relatively good agreement among the 
NNLO, MC@NLO and HERWIG results for the accepted cross-sections, with the MC@NLO 
and HERWIG predictions rescaled to reproduce the fixed order NNLO inclusive cross sec- 
tion before cuts, as explained above. The MC@NLO result is smaller than the NNLO 
prediction by 4 — 14%, depending on the scale choice. HERWIG results differs from the 
NNLO prediction by —3% to +8%. On the contrary, the accepted cross section and con- 
sequently the selection efficiency obtained with PYTHIA appears to be somewhat smaller. 
Depending on the scale choice the difference ranges between 12 and 21%. 

For a better understanding of the results of Table 4, we now analyze the efficiency of 
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individual cuts, applied in turn. The results are given in Table 5, where the efficiencies 
due to a specific cut only (after all previous cuts have been applied) are presented between 
parentheses. We observe the following: 



&&cc/ Cmcl 


Trigger 


+ Jet- Veto 


+ Isolation 


All Cuts 


NNLO (/x = m H /2) 


44.7% 


39.4% (88.1%) 


36.8% (93.4%) 


27.8% (75.5%) 


NNLO (fi = 2 m H ) 


44.9% 


41.8% (93.1%) 


40.7% (97.4%) 


31.0% (76.2%) 


MCQNLO (fj, = mu/2) 


44.4% 


38.1% (85.8%) 


35.3% (92.5%) 


26.5% (75.2%) 


MCQNLO (n = 2 m H ) 


44.8% 


38.8% (86.7%) 


35.9% (92.5%) 


27.0% (75.2%) 


HERWIG 


46.7% 


40.8% (87.4%) 


37.8% (92.7%) 


28.6% (75.7%) 


PYTHIA 


46.6% 


37.9% (81.3%) 


32.2% (85.0%) 


24.4% (75.8%) 



Table 5: Comparison of the predicted selection efficiency after successive application of cuts, 
as obtained by fixed order calculations and event generators. Between parentheses we give the 
efficiency due to a single cut, after all previous cuts have been applied. The event generator 
predictions correspond to the parton level only, i.e., no hadronization and underlying event effects 
are included. The first column lists the lepton selection ("trigger") efficiencies, the second (third) 
columns give the results when also the jet-veto (isolation) cuts are applied in sequence and the last 
column lists the results after applying all remaining cuts. 



• When only the cuts for lepton selection are applied ("Trigger"), we generally find 
very good agreement of all calculations for the corresponding efficiency. In de- 
tail, MC@NLO and NNLO yield almost identical efficiencies, while HERWIG and 
PYTHIA give a slightly higher efficiency. 

• The veto on two or more central jets is a rather critical cut for the achievable ac- 
curacy of the efficiency estimation. This "preselection" cut has been used in the 
recent Tevatron studies. The efficiency of the jet- veto alone, after trigger cuts, varies 
significantly at NNLO by about 5%. As discussed in Section 5, this is due to the 
larger scale variation of the inclusive NNLO cross-section, while at the same time the 
accepted cross-section after the jet-veto application is more stable. For fj, = mn/2, 
where the fixed order expansion demonstrates a faster convergence, we find that the 
jet-veto efficiency at NNLO is in very good agreement with MC@NLO and HERWIG. 
PYTHIA, which is the main tool used in the Tevatron analysis, predicts an efficiency 
which is smaller by 6% for \i = mu/2. 

• PYTHIA also predicts a smaller efficiency by about 8% for the isolation cut. HERWIG, 
MC@NLO and the NNLO calculation are consistent, taking into account scale vari- 
ations. 

• The efficiency of the remaining cuts is very similar in all computations. After pres- 
election, PYTHIA, rescaled with an inclusive NNLO If -factor, predicts between 12 
and 21% less signal events than the NNLO computation. 
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The sensitivity of event generator predictions to cuts that restrict hadronic activity 
requires careful investigation. In particular, it is important to study the effect of hadroniza- 
tion and the underlying event on the efficiency. We have performed such an analysis in the 
case of MC@NLO, where hadronization effects are modeled by HERWIG and the under- 
lying event is simulated by interfacing to the JIMMY package [68], and also for PYTHIA, 
where the hadronization and underlying event modelling is inbuilt. The results are given 
in Table 6. We find only minor differences at the 1% level, which leads us to the conclusion 
that the differences observed with PYTHIA are rather related to its matrix element and 
parton shower implementation. 



Cacc/^mcl 


Trigger 


+ Jet- Veto 


+ Isolation 


All Cuts 


MC@NLO (Parton) 


44.4% 


38.1% 


35.3% 


26.5% 


MC@NLO (Hadron) 


44.6% 


39.6% 


37.4% 


28.3% 


MC@NLO (Had + UE) 


44.6% 


39.4% 


36.5% 


27.6% 


PYTHIA (Parton) 


46.6% 


37.9% 


32.2% 


24.4% 


PYTHIA (Had + UE) 


44.5% 


37.2% 


31.3% 


23.8% 



Table 6: Selection efficiency obtained with MC@NLO {n = mjy/2) and PYTHIA after successive 
application of cuts. The first row gives the results for the purely partonic MC@NLO simulation, 
in the second it is complemented by the cluster hadronization model as implemented in HERWIG, 
and in the third an underlying event model (JIMMY) is added to the simulation. For PYTHIA 
the inbuilt hadronization and underlying event models are applied simultaneously. 



8. Artificial Neural Network 

The current experimental analysis at the Tevatron attempts to distinguish a very small 
number of events from a considerably larger background. For such a task, the use of 
advanced statistical methods is necessary. An integral part of the experimental studies are 
distributions of discrimination variables, defined via artificial neural networks (ANN). It is 
clear that such techniques will become an indispensable tool in many future studies at the 
Tevatron and at the LHC, optimizing the sensitivity of the experiments. 

To the best of our knowledge, so far there has been no study of how the distributions 
of ANN outputs are modified at higher orders in perturbation theory. Here we present for 
the first time an ANN output distribution, computed at fixed order in perturbation theory, 
beyond the leading order. 

In order to study these higher-order effects on the outcome variable, we have built 
an ANN with the tool TMVA v. 3.9.2. [6] based on the Data Analysis Framework ROOT 
v. 5.21.02 [7]. In the construction (the so-called training) of the ANN the user has to 
provide a set of signal and background events, as well as a list of input variables. In our 
study we use the variables defined in Section 6 as input variables. Based on the techniques 
of Multilayer Perceptrons, the ANN then builds an output variable, which is basically 
a non-linear function of the input variables. Since our study is based on Monte Carlo 
truth information, we restrict the set of background to processes that have the same final 
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Figure 4: Kinematic distributions obtained at NNLO in perturbative QCD and with PYTHIA. 
The PYTHIA results are rescaled by an inclusive if-factor in order to reproduce the inclusive cross 
section at NNLO. Shown are the transverse momentum of the leading (p! r ard ) and trailing lepton 
(j>T ft )> the invariant mass of the two leptons (mu), the transverse missing energy (MET) and the 
azimuthal separation of the two charged leptons in the transverse plane (4>u). 



state signature as the signal, i.e. to continuum WW background and top-pair production. 
For both of these processes, as well as for the signal process, we generate a large enough 
event sample (~ 50000 events after pre-selection) with the LO parton-shower Monte Carlo 
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Figure 5: Kinematic distributions obtained at NNLO in perturbative QCD and with MC@NLO. 
The MC@NLO results are rescaled by an inclusive if-factor in order to reproduce the inclusive 
cross section at NNLO. Shown are the transverse momentum of the leading (p^ ald ) and trailing 
lepton (j>T ft )j the invariant mass of the two leptons (mu), the transverse missing energy (MET) 
and the azimuthal separation of the two charged leptons in the transverse plane (4>u). 

PYTHIA8 [60], and use half of the samples to train the ANN. The other half is used in the 
so-called testing step. 

Fig. 6 shows the ANN output for the signal and background samples, for the training 
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Figure 6: Distribution of the ANN output variable coming from the ANN construction procedure. 
On the left, the distribution for the training sample, on the right for the testing sample. 

(left) and the testing (right) samples. The comparison of the training and testing dis- 
tributions serves as a verification that the ANN has not been over-trained, i.e. tuned to 
statistical effects in the training sample. The discrimination power of the ANN variable can 
be seen clearly. While the background distribution peaks at low values, 2 the signal events 
populate the high value range. Distributions like this can serve to distinguish background 
from possible signal events, either by cutting on the ANN output, or by using the shape of 
the distribution to decide whether the observed event set consists of background only or 
background and signal events. 

In Fig. 7 we present the distribution of the ANN output for the signal in fixed-order 
perturbation theory, computed at LO, NLO and NNLO. We find significant radiative cor- 
rections, which however are consistent in magnitude with those for the accepted cross 
section after preselection cuts. 

In Fig. 8 we compare the ANN distribution obtained at NNLO and with MC@NLO. 
Again we find a very good agreement between the fixed order calculation and the MC@NLO 
prediction, when the latter is rescaled with a .fT-factor in order to reproduce the total 
inclusive cross-section. 

Finally, in Fig. 9 we compare the ANN distribution obtained at NNLO QCD and with 
PYTHIA. We see that PYTHIA, even after rescaling with an inclusive i^-factor, yields 
predictions which are smaller by 12-20%, depending on the chosen bin. This difference can 
be traced back to the difference in efficiency already observed at the level of the selection 
cuts placed on the kinematic input distributions. 

Note that we have not included any hadronic variable as an input to the ANN. It is 
clear that stable perturbative patterns are obtained as long as we apply cuts on "leptonic" 
variables only. However, adding a hadronic variable to the list of ANN inputs could produce 
results that are very sensitive to the details of the modeling of the hadronic activity in the 
event generators used for the training of the network. 

2 The background peaks around —1.0 and —0.3 arise mainly from ti and WW respectively. 
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Figure 7: Distribution of the ANN output variable, computed at LO, NLO and NNLO in pertur- 
bation theory. The bands indicate the scale uncertainties. 

9. Conclusions 

In this paper we have studied higher-order QCD effects in the search for a Higgs boson of 
mass mn = 160 GeV at the Tevatron. We have considered a definite set of preselection 
cuts that we believe capture the essential features of the CDF and D0 analyses. We have 
studied the impact of higher order corrections on a set of kinematical distributions of the 
final state leptons. We have then compared these distributions, computed up to NNLO in 
QCD perturbation theory, to those obtained with the PYTHIA, HERWIG and MC@NLO 
event generators. The comparison of distributions does not show significant differences, 
and this is confirmed by a more sophisticated analysis we have performed based on the 
training of our own ANN. 

For the ANN analysis we used only leptonic input variables, in order to reduce sensitiv- 
ity to the modelling of hadronic activity and to allow perturbative evaluation of the output 
distribution. These features are also necessary for the reliable estimation of theoretical 
uncertainties in experimental ANN analyses. 

We have also compared the efficiency of the experimental cuts obtained in NNLO 
QCD to those obtained with PYTHIA, HERWIG and MC@NLO. The efficiencies obtained 
with HERWIG and MCONLO are consistent with that obtained at NNLO. The MC@NLO 
acceptance is slightly smaller than the NNLO acceptance, by 4 — 14%, while the acceptance 
of HERWIG differs from the NNLO prediction by —3% to +8%. In contrast, we find 
that the acceptance computed with PYTHIA is between 12% and 21% smaller than the 
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Figure 8: Distribution of the ANN output variable, computed at NNLO in perturbation theory and 
with MC@NLO. The MC@NLO result is rescaled by an inclusive if-factor in order to reproduce 
the inclusive cross section at NNLO. The bands indicate the scale uncertainties. 



NNLO acceptance, depending on the choice of the factorization and renormalization scale. 
This result is not significantly altered by hadronization and underlying event and appears 
instead to be related to the matrix element and parton shower implementation in PYTHIA 
itself. Since the Tevatron analyses are based on PYTHIA, we believe that this effect could 
be important and requires a more detailed investigation within the framework of the full 
experimental analysis. 

Relevant to the experimental analysis, we have remarked that the combination in 
quadrature of the theoretical errors due to the parton distributions and scale variations 
in Refs. [2,3] implies that the theoretical uncertainty on the total cross section used there 
is likely to be underestimated. We also pointed out that a reweighting of parton shower 
Monte-Carlos to match the fixed-order Higgs px distribution is not appropriate for events 
with a low Higgs px value. Finally, we have demonstrated that a reliable estimation of the 
theoretical uncertainty for Higgs signal cross-sections with defined jet multiplicities requires 
dedicated fixed order computations for each multiplicity. The theoretical uncertainty in 
each jet-bin is different from the theoretical uncertainty of the total cross-section. 
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Figure 9: Distribution of the ANN output variable, computed at NNLO in perturbation theory 
and with PYTHIA. The PYTHIA result is rescaled by an inclusive JC-factor in order to reproduce 
the inclusive cross section at NNLO. The bands indicate the scale uncertainties. 
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